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1.  Introduction 


The  recent  discovery  of  a  hidden  universe  of  tens  of  thousands  of  non-protein-coding  RNAs  is 
revolutionizing  our  thinking  about  the  flow  of  genetic  information  that  shapes  the  complexity  of  the 
human  brain  and  underlies  its  diseases.  A  revised  view  suggests  parallel  processing  from  DNA  to 
protein-coding  mRNAs  as  well  as  non-coding  RNAs  that  are  themselves  active  (without  translation  into 
proteins)  —  with  important  biologic  functions  as  regulators  and  biosensors. 

We  hypothesize  that  in  complex  neurodegenerative  diseases  such  as  Parkinson’s  genetic  and 
epigenetic  risks  (from  aging  and  environmental  exposures)  exert  their  disease-causing  effects  in  part 
through  a  concerted  modulation  of  transcriptional  programs  that  include  non-coding  and  protein¬ 
coding  RNA  functions.  The  aims  of  this  pilot  study  were:  1 ,  Establishing  a  catalog  of  tens  of  thousands 
previously  unexplored,  non-coding  RNAs  and  assess  their  association  with  Parkinson’s  precisely  in 
the  preferentially  vulnerable  cell-type  —  dopaminergic  neurons  laser-captured  from  human  substantia 
nigra  —  using  massively  parallel  sequencing  and  unlimited  ab  initio  reconstruction  of  the  whole 
transcriptome  2,  Beginning  to  delineate  the  genetic  variability  that  underlies  differences  in  non-coding 
RNA  abundance  and  processing  in  human  nigral  dopamine  neurons  using  transcriptome  genetics.  3, 
Evaluating  PD-relevant  transcripts  as  potential  biomarkers  in  a  pilot  study  of  blood  samples  from 
patients  with  PD  and  controls  enrolled  in  the  Harvard  Biomarker  Study. 

In  this  study  we  were  able  to  provide  a  first  look  at  the  “dark  universe”  of  thousands  of  short 
and  long  non-coding  RNAs  expressed  in  the  dopamine  neuron  of  normal  brains  and  in  Parkinson’s 
disease.  Moreover,  the  study  begins  to  clarify  the  role  of  genetic  variants  in  dopamine  neurons  and  to 
translate  transcripts  associated  with  PD-neuropathology.  Finally,  we  provide  proof  of  the  concept  that 
select  RNAs  associated  with  Parkinson’s  in  vulnerable  dopamine  neurons  can  be  translated  into 
personalized,  dual-use,  peripheral  biomarkers.  Validated  biomarkers  will  be  clinically  relevant  for 
identifying  PD-vulnerable  warfigthers  and  civilians,  and  for  monitoring  neurotoxin  exposure  and 
therapeutic  response. 

2.  Keywords 

Parkinson’s  disease,  non-coding  RNA,  RNA  sequencing,  expression  Quantitative  Trait  Locus, 
biomarkers 
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3.  Overall  project  summary 


Aim  1.  Delineating  non-coding  RNAs  associated  with  Parkinson’s  in  nigral  dopamine  neurons 
se  (Fig.  1) 
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Fig.  1.  Non-coding  RNAs  in  dopamine  neurons  of  human  brain  are  associated  with  Parkinson’s  disease. 


33%  of  all  unique  RNAs  robustly  detected  in  nigral  dopamine  neurons  are  long  or  short  non-protein- 
coding  RNAs  (ncRNAs)  (Fig.  la).  In  control  brains,  8,493  ncRNAs  many  of  which  had  never  before 
been  described  in  human  brain  were  identified  by  ab  initio  transcriptome  reconstruction  (Fig.  1b, 
stacked  colored  bar).  These  include  short  ncRNAs  (e.g.  17  snoRNAs  and  11  snRNAs)  as  well  as 
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thousands  of  long,  regulatory,  ncRNAs  (e.g.  2,846  pseudogenes,  2,806  lincRNAs,  and  2,454  natural 
antisense  RNAs. 

121  non-coding  RNAs  (Fig.  1c,  d)  were  statistically  significantly  differentially  expressed  in 
dopamine  neurons  laser-captured  from  substantia  nigra  pars  compacta  of  19  patients  with  PD 
compared  to  62  controls  without  PD  with  genome-wide  significance  (adjusted  for  multiple  testing  using 
the  false  discovery  rate  method)  and  adjusting  for  the  pertinent  covariates  of  age,  gender,  RIN  number, 
postmortem  interval,  and  batch).  These  PD-associated  non-coding  RNAs  included  77  expressed 
pseudogenes,  22  lincRNAs,  17  antisense  transcripts,  as  well  as  sense-intronic,  sense-overlapping, 
and  other  non-coding  RNAs.  Moreover,  we  identified  1,178  protein-coding  mRNAs  significantly  over- 
or  under-expressed  in  dopamine  neurons  captured  from  brains  of  patients  with  PD  (Fig.  1c,  d).  The 
scatter  plot  in  Fig.  Id  visualizes  non-coding  and  coding  RNAs  more  than  three-fold  differentially 
expressed  in  dopamine  neurons  of  patients  with  PD  compared  to  controls.  Non-coding  RNAs  are 
shown  in  magenta.  The  lincRNA,  LINC00674  and  the  expressed  pseudogene  UQRBP1  are 
highlighted  in  Fig.  Id  and  in  the  box  plots  in  Fig.  1e.  Moreover,  expression  changes  in  two  genes, 
which  when  mutated  cause  familial  PD,  LRRK2  and  DNAJC6 1,2  (Fig.  1  d,  e)  were  significant  in 
dopamine  neurons  of  patients  with  PD  compared  to  controls. 


Expression  of  the  dopamine  neuron-specific  ALDH1A1  gene  is  lost  in  neurons  from  patients 
with  Parkinson’s  (Fig.  2). 


Fig.  2.  Expression  of  the  dopamine  neuron-specific  ALDH1A1  gene  (a)  is  lost  in  dopamine  neurons  from  patients 
with  PD  (b). 


We  compared  the  RNA  content  of  dopamine  neurons  and  pyramidal  neurons  laser-captured  from 
control  brains.  Variance-  and  sex-  adjusted  differential  expression  analysis  was  performed.  115  genes 
were  exclusively  transcribed  in  dopamine  neurons  and  not  in  pyramidal  neurons  with  P  <  10"6.  Fig.  2a 
shows  the  top-ranked  22  dopamine  neuron-specific  genes  (black)  and  the  top  5  pyramidal  neuron- 
specific  genes  (magenta).  Confirming  the  cell-type  precision  of  our  method,  these  dopamine-neuron 
specific  transcripts  included  the  established  parts  of  the  dopamine  metabolism  machinery,  the 
vesicular  monoamine  transporter  2  ( VMAT2 ),  dopamine  receptor  D2  ( DRD2 ),  tyrosine  hydroxylase 
(77-/),  and  dopamine  decarboxylase  (DDC).  Surprisingly,  the  by  far  most  abundant  dopamine  neuron- 
specific  transcript  however  was  Aldehyde  Dehydrogenase  1A1  (ALDH1A1).  ALDH1A1  was 
significantly  underexpressed  in  dopamine  neurons  of  cases  with  PD  compared  to  dopamine  neurons 
of  controls  (P  =  1 0'5;  Fig.  2b).  ALDH1A1  mediates  the  oxidation  of  toxic  dopamine-3, 4- 
dihydroxyphenylacetaldehyde  (DOPAL)  into  non-toxic  3,4-dihydroxyphenylacetic  acid  (DOPAC).  In 
mice  ALDH1A1  overexpression  suppresses  A53T-a-synuclein-induced  dopaminergic 
neurodegeneration  and  ALDH1A1  knockout  enhances  dopaminergic  neurodegeneration  in  model 
mice  3.  ALDH1A1  expression  might  also  be  reduced  in  blood  of  patients  with  PD  4. 


Aim  2.  Delineating  the  genetic  variants  regulating  non-coding  RNAs  in  nigral  dopamine 
neurons.  In  order  to  elucidate  the  relationship  between  genetic  variation  and  gene  expression,  we 
have  developed  a  strategy  by  combining  PEER5  and  Matrix-eQTL6  to  conduct  expression  quantitative 
trait  loci  (eQTL)  analysis.  In  brief,  we  normalized  the  gene  expression  by  adjusting  both  known  and 
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hidden  factors  learnt  from  the  Bayesian  framework  PEER,  then  used  Matrix-eQTL  to  call  significant 

cis-e QTL  on  the  residuals 
of  all  genes  for  both 
genotyped  and  imputed 
SNPs. 

Expression  of 

several  genes  was 
associated  with  multiple  cis- 
acting  SNPs  (eSNPs).  The 
expression  of  118  genes 
was  linked  to  5,024  genetic 
variants  with  a  false 
discovery  rate  (FDR)  <  0.05. 
These  “e-genes”  included 
47  protein-coding  genes 
and  a  surprising  number  of 
non-coding  genes,  including 
30  expressed  pseudogenes,  18  antisense  genes,  and  15  lincRNAs.  2,602  eSNPs  were  associated 
with  RP1 1-707023.5  expression,  a  pseudogene  of  MAPK8IP1  (mitogen-activated  protein  kinase  8 
interacting  protein  1).  51  eSNPs  overlapped  with  PD  GWAS  risk  SNPs  in  the  MAPT  susceptibility 
locus.  Importantly,  eQTL  analysis  suggests  that  these  variants  (although  physically  located  in  the 
proximity  of  the  MAPT  locus)  do  not  regulate  expression  of  the  MAPT  gene,  but  instead  of  three  other 
genes  physically  located  nearby  (i.e.  RP1 1-707023.5  with  FDRs  as  low  as  1.08'4,  KANSL1-AS1  with 
FDR  of  0.01 3,  and  LRRC37A2  with  FDR  of  0.0014). 

When  looking  at  the  regression  of  gene  expression  on  genotype,  we  observed  distinct  patterns 
relating  minor  allele  dosage  to  expression.  For  example,  expression  levels  of  the  lincRNA  AP000350.5 
negatively  correlated  with  minor  allele  dosage  of  eSNP  rs5760099  (Fig.  3a),  suggesting  a  repressive 
effect  of  the  minor  allele.  By  contrast,  minor  allele  dosage  of  the  genetic  variant  rs7955007  was 
associated  with  increased  expression  of  RP1 1-726G1 .1  (a  processed  transcript  that  overlaps  with  a 
piRNA;  Fig.  3b),  suggesting  an  activating,  additive  effect.  We  also  observed  eSNP-gene  expression 
pairs  with  an  apparently  dominant,  dosage-independent  relation  (Fig.  3c).  For  example,  expression 
levels  of  the  antisense  transcript  TMLHE-AS1  showed  a  binary  increase  associated  with  the  presence 
(vs.  absence)  of  the  minor  allele  for  rs73569697,  but  not  with  its  dosage. 

Aim  3.  Evaluating  PD  neuropathology-linked  RNAs  as  personalized  blood  biomarkers 


AP000350.5  vs.  rs5760099 


B  RP11-726G1 .1  vs.  TS7955007 


TMLHE-AS1  vs.  rs73569697 


Fig.  3.  Genetic  variants  regulate  non-coding  RNAs  specifically  in  dopamine  neurons. 


Gene 

Mean  mRNA  counts  in 
blood  of  patients  with 

PD  (SD) 

Mean  mRNA  counts  in 
blood  of  healthy 
controls  (SD) 

Difference  of 
means  in  blood 
(%) 

P-value* 

(in  blood) 

P-value# 

(in  dopamine 
neurons) 

SNCA 

2,117  (1,732) 

3,042  (2,030) 

-  30.4% 

3.8x10  15 

0.02 

LRRK2 

10,531  (3,451) 

9,352  (3,159) 

12.6% 

4.7x1 0"08 

0.001 

CHMP3 

2,259  (495) 

2,135  (442) 

5.8% 

4.5x10'°5 

0.01 

COMT 

390  (81) 

405  (97) 

-  3.7% 

0.02 

0.002 

*,  P  value  derived  from  generalized  linear  models  analysis  adjusting  for  age  at  baseline,  gender,  RNA  Integrity  Number,  white  blood 
cell  counts,  red  blood  cell  counts,  and  platelet  counts.  *,  P  value  comparing  transcript  counts  in  dopamine  neurons  of  cases  vs. 
controls  adjusting  for  age,  gender,  RNA  Integrity  Number,  and  batch  number. 


We  evaluated  the  feasibility  of  translating  transcripts  associated  with  PD  neuropathology  in  dopamine 
neurons  of  human  brain  (from  Aim  1)  into  potentially  clinically  useful  biological  markers  detectable  in 
peripheral  blood. 

We  performed  a  cross-sectional  study  using  blood  samples  from  cases  representing  untreated 
de  novo  PD  as  well  as  treated  early-stage  PD  and  age-  and  sex-matched  healthy  controls  enrolled  in 
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the  Harvard  Biomarker  Study  (HBS)7'12  —  a  study  of  >2,500  participants  with  detailed  clinical 
phenotypes  and  longitudinal  biospecimens  collection  co-founded  and  co-directed  by  Dr.  Scherzer.  PD 
patients  met  stringent  UK  PD  Brain  Bank  Society’s  (UKPDBBS)  diagnostic  criteria  and  were  at  an 
early-stage  of  the  disease  as  defined  by  a  Modified  Hoehn  &  Yahr  stage  <  3  assessed  by  a  movement 
disorders  specialist.  This  single-center  study  followed  stringent  rules  of  evidence  for  molecular  marker 

discovery  and  validation 


SNCA-3UTR-2 


LRRK2 


CHMP3 


COMT 


Fig.  4.  Four  transcripts  that  were  abnormally  expressed  in  dopamine  neurons  from  brains 
of  patients  with  PD  were  initially  translated  into  potential  blood  biomarkers.  mRNA  counts 
in  peripheral  blood  of  controls  (HC),  untreated,  de  novo  PD  (d-PD),  and  PD  patients  on  treatment 
(t-PD)  are  shown.  Error  bars,  standard  error. 


and  was  made  possible  by 
our  existing  collection  of 
high-quality  RNA  already 
extracted  from  blood 
samples  of  >2,500 
individuals  enrolled  in  HBS. 

Excitingly,  in  this  proof-of- 
concept  pilot  we  were 
already  able  to  translate 
four  transcripts  that  are 
pathologically  expressed  in 
laser-captured  dopamine 
neurons  of  patients  with  PD 
into  peripheral,  easily 
accessible  possible  blood  biomarkers.  These  transcripts  are  a-synuclein  ( SNCA )  and  LRRK2,  COMT, 
and  CHMP3  (Table  1  and  Fig.  4).  These  four  transcripts  were  statistically  significantly  differentially 
expressed  in  dopamine  neurons  of  patients  with  PD  compared  to  controls  based  on  our  laser-capture, 
total  RNAseq  method  (from  Aim  1;  Table  1)  as  well  as  in  peripheral  blood  samples  from  the  Harvard 
Biomarker  Study  (HBS)  on  the  precise  Nanostring  platform  (Fig.  4).  The  P  values  for  associations  in 
blood  as  well  as  in  dopamine  neurons  are  shown  in  Table  1.  Importantly,  two  of  these  four  transcripts 
are  also  genetically  linked  to  familial  and  sporadic  PD:  a-synuclein  (SNCA)  and  LRRK2. 

The  SNCA  transcript  was  further  evaluated  in  neuroimaging-supported,  early  stage  PD  on  the 
qPCR,  microarray,  and  NanoString  platform  in  the  PROBE  and  PPMI  study,  in  addition  to  HBS.  We 
found  that  SNCA  transcripts  in  circulating  blood  cells  were  reduced  in  early  stage,  untreated,  and 
dopamine  transporter  neuroimaging-supported  Parkinson's  disease  (PD)  in  three  independent 
regional,  national,  and  international  populations  representing  500  cases  and  363  controls  and  on  three 
analog  and  digital  platforms  with  P  =  0.0001  in  meta-analysis.  Individuals  with  SNCA  transcripts  in  the 
lowest  quartile  of  counts  had  an  odds  ratio  for  PD  of  2.45  compared  to  individuals  in  the  highest 
quartile.  Disease-relevant  transcript  isoforms  were  low  even  near  disease  onset.  Importantly,  low 
SNCA  transcript  abundance  predicted  cognitive  decline  in  patients  with  Parkinson’s  during  up  to  five 
years  of  longitudinal  follow-up.  These  important  findings  on  SNCA  were  published  in  the  prestigious 
journal  Brain  in  August  of  2015  (Locascio  et  al.,  Brain,  2015;  the  reprint  is  attached  in  the  Appendix). 

The  other  three  transcripts  -  LRRK2,  COMT,  and  CHMP3  -  significantly  associated  with  early 
stage  PD  in  blood  (Fig.  4)  are  also  directly  relevant  to  the  neuropathology  with  significant  expression 
changes  in  dopamine  neurons  (Table  1).  LRRK2  is  genetically  strongly  implicated  in  both  familial  and 
sporadic  PD.  COMT  transcripts  encode  catechol-O-methyl-transferase,  an  essential  gene  in  the 
metabolism  of  L-dopa.  Finally,  the  vacuolar  Protein  Sorting  24  Homolog  (CHMP3)  was  pathologically 
expressed  in  our  RNAseq  analysis  of  dopamine  neurons  from  patients  with  PD.  It  is  a  novel  PD- 
associated  transcript,  to  our  knowledge  previously  not  linked  to  Parkinson’s.  Interestingly,  the  gene 
CHMP3  encodes  a  protein  that  sorts  transmembrane  proteins  into  lysosomes/vacuoles  via  the 
multivesicular  body  pathway.  It  is  part  of  endosomal  sorting  complexes  required  for  transport  (ESCRT). 
This  pathway  is  implicated  in  the  pathogenesis  of  sporadic  PD13. 

This  study  provides  proof  of  the  concept  that  select  RNAs  associated  with  Parkinson’s  in 
vulnerable  dopamine  neurons  can  be  translated  into  personalized,  dual-use,  peripheral  biomarkers. 
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4.  Key  research  accomplishments 

•  This  study  provides  the  first  comprehensive  catalog  of  gene  expression  -  both  protein-coding 
and  non-coding  -  in  a  specific  cell  type,  not  in  cultured  cells  or  mice,  but  in  situ  in  human  brain. 

•  Going  beyond  traditional  mRNA  sequencing,  all  polyadenylated  and  non-polyadenylated 
transcripts  over  50  bases  were  ultra  deeply  sequenced  using  ribo-depleted  RNA  from 
dopamine  neurons  laser-captured  from  human  post-mortem  brains  of  patients  with  PD  and 
controls  using  a  new  method  we  developed,  termed  laserRNAseq. 

•  A  vast  universe  of  annotated  as  well  as  novel  non-coding  RNAs  expressed  in  dopamine 
neurons  was  discovered  suggesting  a  more  diverse  and  much  more  complex  transcriptional 
architecture  than  previously  imagined. 

•  The  expression  of  121  non-coding  transcripts  and  more  than  a  thousand  protein-coding 
transcripts  in  nigral  dopamine  neurons  was  significantly  associated  with  Parkinson’s  Lewy 
body  neuropathology. 

•  Expression  of  the  gene  ALDH1A1  that  plays  a  critical  role  in  scavenging  toxic  dopamine 
metabolites  is  lost  in  dopaminergic  neurons  laser-captured  from  patients  with  PD 

•  This  study  provides  proof  of  the  concept  that  select  RNAs  associated  with  Parkinson’s  in 
vulnerable  dopamine  neurons  can  be  translated  into  personalized,  dual-use,  peripheral 
biomarkers.  Specifically,  we  were  able  to  translate  four  transcripts  ( SNCA ,  LRRK2,  COMT  and 
CHMP3)  that  were  abnormally  transcribed  in  laser-captured  dopamine  neurons  of  PD  brains 
into  potential  blood  biomarkers  for  patients  with  Parkinson’s. 

5.  Conclusion 

This  study  provides  a  first  look  at  the  “dark  universe”  of  tens  of  thousands  of  short  and  long  non¬ 
coding  RNAs  expressed  in  the  human  dopamine  neuron  of  controls  and  in  Parkinson’s  disease.  We 
show  unequivocally  that  non-coding  RNAs  are  abundantly  transcribed  in  dopamine  neurons  of  human 
brain.  We  identify  highly  significant  changes  in  the  expression  of  both  protein-coding  and  non-coding 
transcripts  in  Parkinson’s  disease.  These  changes  indicate  key  roles  for  non-coding  RNAs  and  for  the 
protein-coding  familial  PD  gene  DNAJC6  and  ALDH1A1  in  the  survival  of  dopamine  neurons  in 
Parkinson’s.  ALDH1A1  mediates  the  oxidation  of  toxic  dopamine-3, 4-dihydroxyphenylacetaldehyde 
(DOPAL)  into  non-toxic  3,4-dihydroxyphenylacetic  acid  (DOPAC).  In  mice  ALDH1A1  overexpression 
suppresses  A53T-a-synuclein-induced  dopaminergic  neurodegeneration  and  ALDH1A1  knockout 
enhances  dopaminergic  neurodegeneration3. 

By  translating  four  transcripts  (that  were  found  to  be  pathologically  expressed  in  dopamine 
neurons  of  patients  with  PD)  into  potential  biomarkers  accessible  in  blood  of  patients  and  controls,  we 
provide  proof  of  concept  for  the  idea  that  the  activity  of  select  PD-relevant  genes  can  be  precisely 
monitored  in  blood  tests.  In  support  of  this  view  we  show  robust  evidence  indicating  significant 
associations  between  SNCA,  LRRK2,  COMT,  and  CHMP3  mRNA  counts  in  blood  and  Parkinson’s 
using  precise  digital  gene  expression  analysis  with  molecular  barcodes  on  the  NanoString  platform. 
Additionally,  previous  reports  indicate  that  ALDH1A1  expression  might  also  be  reduced  in  blood  of 
patients  with  PD4. 

In  future  steps,  we  will  continue  to  delineate  all  regions  of  the  human  genome  actively 
transcribed  in  dopamine  neurons.  We  will  fully  decode  all  non-coding  RNAs  active  in  dopamine 
neurons.  We  will  extend  our  investigation  from  known  non-coding  RNAs  annotated  in  the  GENECODE 
database  to  the  characterization  of  thousand  of  novel,  previously  not  described  non-coding  RNAs 
emerging  from  these  analyses  (such  as  novel  transcribed  enhancers  or  eRNAs  and  new  classes  of 
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RNA  biotypes).  We  will  increase  statistical  power  and  scientific  knowledge  by  extending  the  current 
observations  to  larger  numbers  of  high-quality  brains,  to  pre-clinical  Lewy  body  neuropathology,  and  to 
additional  brain  cell-types  (e.g.  pyramidal  cells).  Genetically,  as  our  sample  size  becomes  larger  we 
will  be  increasingly  powered  to  uncover  how  genetic  risk  variants  causally  contribute  to  the 
degeneration  of  dopamine  neurons  via  dysregulation  of  cell-type-specific  transcription.  Clinically,  we 
will  continue  to  translate  PD-associated  transcripts  into  peripheral  tests  in  increasingly  large8  and 
definitive  studies.  Moreover,  we  aim  to  assemble  validated  transcriptional  and  genetic  markers  into 
polygenic  test  panels  with  improved  sensitivity  and  specificity.  Validated  biomarkers  will  be  clinically 
relevant  for  identifying  PD-vulnerable  warfigthers  and  civilians,  and  for  monitoring  neurotoxin  exposure 
and  therapeutic  response  in  clinical  trials  and  eventually,  clinical  practice. 


10 


6.  Publications  since  submission  of  the  original  application  (chronologic  order;  *,  denotes 
corresponding  or  co-corresponding  author) 

*1.  Unrecognized  vitamin  D3  deficiency  is  common  in  Parkinson’s:  Harvard  Biomarker  Study.  Ding  H, 
Dhima  K,  Locascio  J,  Sohur  US,  Wills  A-M,  Flaherty  AW,  Unni  VK,  Hung  AY,  Selkoe  DJ, 
Schwarzschild  MA,  Schlossmacher  MG,  Sudarsky  LR,  Growdon  JH,  Ivinson  AJ,  Hyman  BT, 
Scherzer  CR*.  Neurology,  2013;  81:  1531-7.  Perspective  in  Nature  Reviews  Neurology,  2014;  10: 
8-9.  doi:  10.1212/01.wnl.0000449750.81263.7d.  PMID:  24799519.  (Impact  Factor  =  8). 

2.  Specific  splice  variants  are  associated  with  Parkinson’s  disease.  JA  Santiago,  Scherzer  CR, 

Harvard  Biomarker  Study,  and  JA  Potashkin.  Mov  Disord,  2013;  28:1724-7.  doi: 

10.1 002/mds. 25635.  PMID:  24108702.  (Impact  Factor  =  5.7) 

3.  A  novel  method  for  detecting  7-methyl  guanine  reveals  aberrant  methylation  levels  in  Huntington 
disease.  Thomas  B,  Matson  S,  Chopra  V,  Sun  L,  Sharma  S,  Hersch  S,  Rosas  HD,  Scherzer  CR, 
Ferrante  R,  Matson  W.  Anal  Biochem.  2013;  436:  112-20.  doi:  10.1016/j.ab.2013.01.035. 
PMID:23416183. 

*4.  Metallothioneins  as  dynamic  markers  for  brain  disease  in  lysosomal  disorders.  Cesani  M,  Cavalca 
E,  Macco  R,  Leoncini  G,  Terreni  MR,  Lorioli  L,  Furlan  R,  Comi  G,  Doglioni  C,  Zacchetti  D,  Sessa  M, 
Scherzer  CR*,  Biffi  A*,  Ann  of  Neurol,  2014;  75:  127-37.  doi:  10.1002/ana. 24053. 
PMID:24242821.  (Impact  Factor  =  10) 

5.  Motor  impulsivity  in  Parkinson  disease:  Associations  with  COMT  and  DRD2  polymorphisms. 
Ziegler  DA,  Ashourian  P,  Wonderlick  JS,  Sarokhan  AK,  Prelec  D,  Scherzer  CR,  Corkin  S.  Motor 
impulsivity  in  Parkinson  disease:  Associations  with  COMT  and  DRD2  polymorphisms,  Scand  J 
Psychol ..  2014;  55:278-86.  doi:  10.1 1 1 1/sjop. 121 13.PMID:24749760. 

6.  MHC-I  expression  renders  catecholaminergic  neurons  susceptible  to  T-cell-mediated  degeneration. 
Cebrian  C,  Zucca  FA,  Mauri  P,  Steinbeck  JA,  Studer  L,  Scherzer  CR,  Kanter  E,  Budhu  S, 
Mandelbaum  J,  Vonsattel  JP,  Luigi  Zecca,  Loike  JD,  Sulzer  D.  Nat  Commun,  2014;  5:  3633.  doi: 
10.1038/ncomms4633.  PMID:24736453.  (Impact  Factor  =  11.5) 

7.  High  prevalence  of  NMDA  receptor  IgA/IgM  antibodies  in  different  dementia  types.  Doss  S, 
Wandinger  KP,  Hyman  BT,  Panzer  JA,  Synofzik  M,  Dickerson  B,  Mollenhauer  B,  Scherzer  CR, 
Ivinson  AJ,  Finke  C,  Schols  L,  Muller  Vom  Hagen  J,  Trenkwalder  C,  Jahn  H,  Holtje  M,  Biswal  BB, 
Harms  L,  Ruprecht  K,  Buchert  R,  Hoglinger  GU,  Oertel  WH,  Unger  MM,  Kortvelyessy  P,  Bittner  D, 
Priller  J,  Spruth  EJ,  Paul  F,  Meisel  A,  Lynch  DR,  Dirnagl  U,  Endres  M,  Teegen  B,  Probst  C, 
Komorowski  L,  Stocker  W,  Dalmau  J,  Pruss  H.  Ann  Clin  Trans I  Neurol.  2014;  1:  822-32.  doi: 
10.1002/acn3.120.  PMID:25493273. 

8.  Network  analysis  identifies  SOD2  mRNA  as  a  potential  biomarker  for  Parkinson's  disease.  Santiago 
JA,  Scherzer  CR,  Potashkin  JA.  PLoS  One.  2014;  9:  e109042.  doi:  10. 1371/journal. pone. 0109042. 
eCollection  2014.  PMID:2527975. 

*9.  Association  between  a-synuclein  blood  transcripts  and  early,  neuroimaging-supported  Parkinson's 
disease.  Locascio  J  J,  Eberly  S,  Liao  Z,  Liu  G,  Hoesing  AN,  Duong  K,  Trisini-Lipsanopoulos  A, 
Dhima  K,  Hung  AY,  Flaherty  AW,  Schwarzschild  MA,  Hayes  MT,  Wills  AM,  Shivraj  Sohur  U,  Mejia 
Nl,  Selkoe  DJ,  Oakes  D,  Shoulson  I,  Dong  X,  Marek  K,  Zheng  B,  Ivinson  A,  Hyman  BT,  Growdon 
JH,  Sudarsky  LR,  Schlossmacher  MG,  Ravina  B,  Scherzer  CR.  Brain.  2015  Jul  28.  pii:  awv202. 
[Epub  ahead  of  print]  PMID:  26220939.  (Impact  Factor  =  9) 


11 


Presentations 


2013  Can  BIG  DATA  solve  big  questions? 

Movement  Disorders  Conference,  Department  of  Neurology 
Massachusetts  General  Hospital 
Boston,  MA 

2014  Can  BIG  data  help  neurological  disease? 

Massachusetts  General  Hospital,  J.  Philip  Kistler  Stroke  Center  Meeting 
Massachusetts  General  Hospital,  Boston,  MA 

2014  From  BIG  data  to  personalized  medicine  for  Parkinson’s 
Icahn  School  of  Medicine  at  Mount  Sinai 
New  York,  NY 

2014  Mapping  transcriptome  space  in  Parkinson's:  from  the  known  world  to  an  expanding 
universe 

Molecular  Pathology  Seminar 

Johns  Hopkins  University,  Baltimore,  MD 

2014  The  PD  Brain  Map:  How  does  the  human  genome  function  in  dopamine  neurons? 
NIH/NINDS  Parkinson’s  Disease  Genetics  and  Systems  Biology  Workshop 
Bethesda,  MD 

2014  Human  genome  function  in  Parkinson’s 
Neurology  Department,  Invited  Lecture 
Goethe  University,  Frankfurt,  Germany 

2015  Parkinson’s  disease:  How  Big  Data  will  enable  a  future  personal  medicine 
Genzyme 

Framingham,  MA 

Abstracts 

2013  The  PD  Brain  Map  Project:  Mapping  the  Transcriptional  Architecture  of  Dopamine 
Neurons  in  Human  Brain 

Annual  Meeting  of  the  American  Society  for  Human  Genetics 
Boston,  MA 

2015  Functional  characterization  of  the  human  brain  transcriptome  in  Parkinson’s  disease 
European  Molecular  Biology  Laboratory  Meeting 
Heidelberg,  Germany 

2015  BRAINCODE:  How  Does  the  Human  Genome  Function  in  Specific  Brain  Neurons? 
Annual  Meeting  of  the  American  Society  for  Human  Genetics 
Baltimore,  MD 

7.  Inventions,  Patents  and  Licenses  -  Nothing  to  report 

8.  Reportable  Outcomes  -  Nothing  to  report 


12 


9.  Other  Achievements 


2015  -  Scientific  Advisory  Board,  American  Parkinson  Disease  Association 

2015  Neurotoxin  Exposure  Treatment  PD  Vision  Setting  Panel,  U.S.  Department  of  Defense 

10.  References 

1.  Bras  JM,  Singleton  A.  Genetic  susceptibility  in  Parkinson's  disease.  Biochim  Biophys  Acta 
2009;1792:597-603. 

2.  Singleton  AB,  Farrer  MJ,  Bonifati  V.  The  genetics  of  Parkinson's  disease:  progress  and 
therapeutic  implications.  Mov  Disord  2013;28:14-23. 

3.  Liu  G,  Yu  J,  Ding  J,  et  al.  Aldehyde  dehydrogenase  1  defines  and  protects  a  nigrostriatal 
dopaminergic  neuron  subpopulation.  J  Clin  Invest  2014. 

4.  Molochnikov  L,  Rabey  JM,  Dobronevsky  E,  et  al.  A  molecular  signature  in  blood  identifies  early 
Parkinson's  disease.  Mol  Neurodegener  2012;7:26. 

5.  Stegle  O,  Parts  L,  Piipari  M,  Winn  J,  Durbin  R.  Using  probabilistic  estimation  of  expression 
residuals  (PEER)  to  obtain  increased  power  and  interpretability  of  gene  expression  analyses.  Nat 
Protoc  2012;7:500-7. 

6.  Shabalin  AA.  Matrix  eQTL:  ultra  fast  eQTL  analysis  via  large  matrix  operations.  Bioinformatics 
2012;28:1353-8. 

7.  Mohammadi  D.  The  Harvard  Biomarker  Study's  big  plan.  Lancet  Neurol  2013;12:739-40. 

8.  Locascio  J  J,  Eberly  S,  Liao  Z,  et  al.  Association  between  alpha-synuclein  blood  transcripts  and 
early,  neuroimaging-supported  Parkinson's  disease.  Brain  2015. 

9.  Ziegler  DA,  Ashourian  P,  Wonderlick  JS,  et  al.  Motor  impulsivity  in  Parkinson  disease: 
associations  with  COMT  and  DRD2  polymorphisms.  Scand  J  Psychol  2014;55:278-86. 

10.  Ding  H,  Dhima  K,  Lockhart  KC,  et  al.  Unrecognized  vitamin  D3  deficiency  is  common  in 
Parkinson  disease:  Harvard  Biomarker  Study.  Neurology  2013. 

11.  Pankratz  N,  Beecham  GW,  DeStefano  AL,  et  al.  Meta-analysis  of  Parkinson's  disease: 
identification  of  a  novel  locus,  RIT2.  Ann  Neurol  2012;71:370-84. 

12.  Ding  H,  Sarokhan  AK,  Roderick  SS,  et  al.  Association  of  SNCA  with  Parkinson:  replication  in  the 
Harvard  NeuroDiscovery  Center  Biomarker  Study.  Mov  Disord  201 1  ;26:2283-6. 

13.  Kurashige  T,  Takahashi  T,  Yamazaki  Y,  et  al.  Localization  of  CHMP2B-immunoreactivity  in  the 
brainstem  of  Lewy  body  disease.  Neuropathology  2013;33:237-45. 


11.  Appendix 

Association  between  a-synuclein  blood  transcripts  and  early,  neuroimaging-supported  Parkinson's 
disease 

Locascio  J  J,  Eberly  S,  Liao  Z,  Liu  G,  Hoesing  AN,  Duong  K,  Trisini-Lipsanopoulos  A,  Dhima  K,  Hung 
AY,  Flaherty  AW,  Schwarzschild  MA,  Hayes  MT,  Wills  AM,  Shivraj  Sohur  U,  Mejia  Nl,  Selkoe  DJ, 
Oakes  D,  Shoulson  I,  Dong  X,  Marek  K,  Zheng  B,  Ivinson  A,  Hyman  BT,  Growdon  JH,  Sudarsky  LR, 
Schlossmacher  MG,  Ravina  B,  and  Scherzer  CR. 

Brain.  2015  Jul  28.  pii:  awv202.  [Epub  ahead  of  print] 


13 


Brain  Advance  Access  published  July  28,  2015 


Association  between  a-synudein  blood 
transcripts  and  early,  neuroimaging-supported 
Parkinson’s  disease 

Joseph  J.  Locascio,1’2’*  Shirley  Eberly,3’*  Zhixiang  Liao,1’4’*  Ganqiang  Liu,1’4 

Ashley  N.  Hoesing,1’4’5  Karen  Duong,1’4’5  Ana  Trisini-Lipsanopoulos, 1,4,5  Kaltra  Dhima, 1,4,5 

Albert  Y.  Hung,2  Alice  W.  Flaherty,2’6  Michael  A.  Schwarzschild,2  Michael  T.  Hayes,7 

Anne-Marie  Wills,2’5  U.  Shivraj  Sohur,2  Nicte  I.  Mejia,2  Dennis  J.  Selkoe,4’7  David  Oakes,3 

Ira  Shoulson,8  Xianjun  Dong,1’4  Ken  Marek,9  Bin  Zheng,1’4  Adrian  Ivinson,4’5 

Bradley  T.  Hyman,  ’  John  H.  Growdon,  Lewis  R.  Sudarsky,  Michael  G.  Schlossmacher, 

Bernard  Ravina11  and  Clemens  R.  Scherzer1’2’4’5’7 


*  These  authors  contributed  equally  to  this  work. 


There  are  no  cures  for  neurodegenerative  diseases  and  this  is  partially  due  to  the  difficulty  of  monitoring  pathogenic  molecules  in 
patients  during  life.  The  Parkinson’s  disease  gene  a-synuclein  ( SNCA )  is  selectively  expressed  in  blood  cells  and  neurons.  Here  we 
show  that  SNCA  transcripts  in  circulating  blood  cells  are  paradoxically  reduced  in  early  stage,  untreated  and  dopamine  transporter 
neuroimaging-supported  Parkinson’s  disease  in  three  independent  regional,  national,  and  international  populations  representing 
500  cases  and  363  controls  and  on  three  analogue  and  digital  platforms  with  P  <  0.0001  in  meta-analysis.  Individuals  with  SNCA 
transcripts  in  the  lowest  quartile  of  counts  had  an  odds  ratio  for  Parkinson’s  disease  of  2.45  compared  to  individuals  in  the  highest 
quartile.  Disease-relevant  transcript  isoforms  were  low  even  near  disease  onset.  Importantly,  low  SNCA  transcript  abundance 
predicted  cognitive  decline  in  patients  with  Parkinson’s  disease  during  up  to  5  years  of  longitudinal  follow-up.  This  study  reveals 
a  consistent  association  of  reduced  SNCA  transcripts  in  accessible  peripheral  blood  and  early-stage  Parkinson’s  disease  in  863 
participants  and  suggests  a  clinical  role  as  potential  predictor  of  cognitive  decline.  Moreover,  the  three  independent  biobank 
cohorts  provide  a  generally  useful  platform  for  rapidly  validating  any  biological  marker  of  this  common  disease. 
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Introduction 

Nearly  5  million  people  have  Parkinson’s  disease,  a  spor¬ 
adic,  progressive  neurodegenerative  disorder  linked  to  a 
complex  genetic  architecture  and  environmental  exposures. 
Over  the  next  15  years,  this  number  will  almost  double  to 
9  million  patients  worldwide  (Dorsey  et  al.,  2007).  Already 
today,  the  costs  to  public  health  are  enormous — estimated 
at  $10.8  billion  annually  in  the  USA  alone  (O’Brien  et  al., 
2009).  There  are  no  cures  or  disease-modifying  therapies 
for  Parkinson’s  disease,  and  this  may  be  due  in  part  to  our 
inability  to  monitor  biological  markers  of  early  disease. 
Currently,  patients  with  Parkinson’s  disease  are  diagnosed, 
cared  for,  and  assigned  to  clinical  trials  based  on  medical 
history  and  physical  exam. 

To  translate  research  progress  into  the  clinic,  simple,  dy¬ 
namic  biomarkers  are  needed  that  are  rooted  in  genetic 
discoveries.  Traditionally,  Parkinson’s  disease  has  been  ap¬ 
proached  from  a  clinical  perspective,  as  a  homogeneous 
movement  disorder  typified  by  the  triad  of  resting  tremor, 
bradykinesia  and  rigidity.  Based  on  this  clinical  view  of  the 
disease,  it  was  postulated  that  a  single  laboratory  or  ima¬ 
ging  biomarker  for  Parkinson’s  disease  would  be  developed 
to  serve  as  a  universal  diagnostic.  This  goal  has  been  elu¬ 
sive.  More  recently,  a  human  genome-inspired  view  of 
Parkinson’s  disease  has  provided  unequivocal  evidence  for 
a  diversity  of  genetic  variants,  each  exacting  a  small,  but 
definitive  effect,  underlying  the  clinical  phenotype  (Nalls 
et  al.,  2014).  It  is  becoming  clearer  that  Parkinson’s  disease 
is  aetiologically,  genetically,  pathologically  and  even  clinic¬ 
ally  more  heterogeneous  than  previously  implied  (Scherzer 
and  Feany,  2004;  Scherzer  et  al.,  2004;  Forman  et  al., 
2005).  In  light  of  this  diversity  the  traditional  expectation 
of  a  ‘one  size  fits  all’  laboratory  diagnostic  that  can  detect 
all  patients  and  answer  all  clinical  questions  may  need  to  be 
revised.  Personalized  biomarkers,  tailored  to  patient  endo- 
phenotypes  and  specific  clinical  applications,  are  likely  to 
emerge.  They  may  more  appropriately  describe  the  molecu¬ 
lar  architecture  of  the  disease. 

What  are  the  practical  applications  that  such  biomarkers 
may  serve,  if  they  cannot  be  used  as  a  universal  diagnostic? 
Prognostic  and  mechanism-directed  biomarkers  will  help  to 
stratify  patients  for  clinical  trials.  Risk  markers  are  required 
for  identifying  high-risk  individuals  prior  to  the  onset  of 
overt  clinical  symptoms,  when  disease-modifying  interven¬ 
tions  will  be  most  effective.  Pharmacodynamic  markers  are 
required  to  determine  whether  an  investigational  drug 


engages  the  intended  target.  In  cancer  drug  development, 
biomarker  use  has  increased  average  phase  III  trial  success 
by  34%  (Hayashi  et  al.,  2013)  and  shortened  total  drug 
development  timelines  (Chin  et  al.,  2011).  Emerging  CSF 
tests  for  Parkinson’s  disease  (Mollenhauer  et  al.,  2011)  and 
dopamine  transporter  imaging  are  relatively  invasive  or 
time-consuming  and  expensive.  Blood-based  tests  are  at¬ 
tractive,  but  no  disease-causing  molecule  has  been  rigor¬ 
ously  validated. 

Accumulation  of  a-synuclein  protein  (encoded  by  the 
SNCA  gene)  in  the  brain  is  a  hallmark  of  Parkinson’s  dis¬ 
ease  (Braak  et  al.,  2002).  For  nearly  two  decades  since  its 
characterization  as  an  electric  ray  protein,  the  Parkinson’s 
disease  gene  a-synuclein  was  thought  of  as  ‘neuron-specific’ 
(Maroteaux  et  al.,  1988),  but  this  view  was  disproven  with 
the  discovery  of  high  levels  of  a-synuclein  expression  in 
blood  cells,  particularly  those  of  erythroid  lineage  (Nakai 
et  al.,  2007;  Scherzer  et  al.,  2008;  Maitta  et  al.,  2011). 
Here  we  tested  whether  levels  of  SNCA  transcripts 
(mRNAs)  in  circulating  blood  cells  might  serve  to  identify 
individuals  with  increased  risk  of  Parkinson’s  disease.  We 
evaluated  whether  levels  of  a-synuclein  gene  (SNCA)  tran¬ 
scripts  are  associated  with  the  presence  of  early-stage,  clin¬ 
ically  and  neuroradiologically  confirmed  Parkinson’s 
disease  in  a  platform  of  three  independent  biomarker 
study  cohorts. 

Materials  and  methods 

Brief  methods  are  summarized  below.  See  the  online 
Supplementary  material  for  detailed  clinical,  laboratory  and 
statistical  experimental  methods  for  the  Harvard  Biomarker 
Study  (HBS),  Prognostic  Biomarkers  in  Parkinson’s  Disease 
Study  (PROBE)  and  Parkinson’s  Progression  Markers 
Initiative  (PPMI),  respectively,  and  for  the  meta-analysis 
across  three  studies. 

Biospecimen  collections 

Briefly,  the  studies  were  designed  to  minimize  bias  from  sample 
processing  by  collecting,  handling  and  analysing  specimens  of 
cases  and  controls  in  a  standardized  manner  following  defined 
operating  procedures  and  according  to  rules  of  evidence 
(Ransohoff,  2004,  2005;  Hennecke  and  Scherzer,  2008). 
Cases  and  controls  were  processed  in  parallel  by  technicians 
blinded  to  diagnosis  to  avoid  bias  due  to  ‘run  order’  of  sam¬ 
ples.  Venous  whole  blood  was  collected  in  PAXgene™  tubes. 
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Blood  and  PAXgene™  reagents  were  mixed  by  gently  invert¬ 
ing  8-10  times. 

RNA  isolation  and  quality  control 

RNA  was  extracted  according  to  the  PAXgene™  Blood  RNA 
kit  manual  extraction  protocol  including  DNase  treatment. 
RNA  quality  was  determined  with  the  RNA  6000  Nano  Chip 
on  an  Agilent  2100  Bioanalyzer  (Agilent  Technologies)  and  the 
RNA  Integrity  Number  package  (Imbeaud  et  al.,  2005). 

Quantitative  PCR  and  microarray 
expression  analyses 

Quantitative  PCR  was  performed  similar  to  Scherzer  et  al. 
(2008).  Microarray  procedures  were  performed  as  described 
(Zheng  et  al.,  2010)  using  Illumina  HumanHT-12v3 
Expression  BeadChips.  The  PROBE  microarray  data  set  has 
been  submitted  to  the  GEO  database  (accession  number 
GSE57475). 

NanoString  assay  performance 

Probes  were  designed  according  to  the  manufacturer’s  design 
principles  (Geiss  et  al.,  2008),  including  screening  for  inter- 
and  intra-reporter  and  capture  probe  interactions,  and  selec¬ 
tion  for  probes  with  optimal  melting  temperatures  (Geiss  et  al., 
2008).  Direct  counts  of  the  target  RNAs  were  measured  in 
125  ng  of  RNA  by  digital  expression  analysis  based  on 
NanoString  technology  (without  reverse  transcription  into 
cDNA).  Probes  for  the  target  and  control  RNAs  were  multi¬ 
plexed  and  assayed  on  the  nCounter  Digital  Analyzer.  The 
laboratory  running  the  assay  was  blinded  to  the  diagnosis. 
No-template  (negative)  controls  containing  water  substituted 
for  template  were  run  and  no  signal  was  detected.  To  avoid 
run-order  bias,  samples  of  cases  or  controls  were  randomly 
assigned  to  plates.  To  control  for  plate-to-plate  variation  and 
drift,  equal  amounts  of  Human  Universal  Reference  RNA  were 
included  at  the  beginning,  within,  and  at  the  end  of  the  entire 
experiment.  Transcript  counts  (excluding  SNCA- 007)  mea¬ 
sured  in  reference  RNA  were  highly  correlated  with 
R2  >  0.999  both  within  one  plate  and  in-between  different 
plates,  thus  excluding  drift  as  a  potential  source  of  bias  in 
the  experiment.  Furthermore,  5%  of  participants’  samples 
were  randomly  resampled  to  verify  the  retest  reliability  (tech¬ 
nical  precision).  The  average  R2  was  0.98  for  these  correl¬ 
ations.  The  NanoString  data  are  accessible  through  the  PPMI 
website  (http://www.ppmi-info.org). 

Results 

Harvard  Biomarker  Study:  SNCA 
mRNA  abundance  in  early-stage 
Parkinson’s  disease 

We  first  evaluated  relative  SNCA  mRNA  abundance  in  a 
case-control  study  nested  in  the  HBS  using  precise,  kinetic, 
quantitative  PCR  based  on  fluorogenic  5’  nuclease  chemis¬ 
try  (quantitative  PCR).  We  specifically  designed  the  HBS  as 


Table  I  The  HBS:  patient  characteristics 


Patients  Controls  P-value 
with 

Parkinson’s 

disease 

(n  =  222)  (n  =  1 83) 


Age  at  phlebotomy 

67.05 

(9.53) 

68.44  (10.23) 

(years,  SD) 

Gender 

Male  ( n ,  %) 

130 

(58.6) 

74  (40.4) 

Female  (n,  %) 

92 

(41.4) 

109  (59.6) 

Clinical  findings  (n,  %) 

UKPDBBS3 

207 

(93.2) 

Resting  tremor 

164 

(73.87) 

Bradykinesia 

221 

(99.59) 

Rigidity 

219 

(98.65) 

Postural  imbalance 

109 

(49.10) 

Asymmetric  onset 

175 

(78.83) 

Medications  (n,  %) 

Levodopa 

170 

(76.58) 

Dopamine  replacement 

201 

(90.5) 

medications 

De  novo 

21 

(9.5%) 

Disease  severity 

Disease  duration 

4.8 

(4.5) 

(years)  (mean,  SD) 

Hoehn  and  Yahr  (mean,  SD) 

2.1 

(0.6) 

UPDRS  Part  1:  mentation, 

1.5 

(1.5) 

behaviour,  mood 

UPDRS  Part  II:  ADLs 

8.7 

(5.2) 

UPDRS  Part  III:  motor 

18.5 

(9.0) 

examination 

UPDRS  total  score 

30.9 

(13.6) 

aUnited  Kingdom  Parkinson’s  Disease  Brain  Bank  Society  diagnostic  criteria. 
ADL  =  activities  of  daily  living. 


a  clinical  biomarker  study  with  rigorous,  predefined  collec¬ 
tion  and  processing  protocols.  Cases  and  controls  had  simi¬ 
lar  ages,  but  patients  with  Parkinson’s  disease  were  more 
likely  to  be  males  (Table  1).  Cases  were  at  an  early  stage  of 
the  disease,  with  a  mean  modified  Hoehn  and  Yahr  stage  of 
2.1.  A  large  majority  of  cases  (201  of  222,  90.5%)  were  on 
medications  that  ameliorate  the  dopamine  deficiency  caused 
by  the  degeneration  of  neurons  in  the  substantia  nigra, 
while  9.5%  (21  of  222)  were  untreated,  de  novo  patients. 
The  controls  were  recruited  from  the  same  source  popula¬ 
tion.  Case  and  control  samples  were  collected,  processed, 
and  analysed  in  parallel.  Samples  were  required  to  meet 
stringent  quality  control  criteria  in  order  to  enter  the 
study  including  a  RNA  Integrity  Number  (RIN)  (Auer 
et  al.,  2003)  threshold  of  ^7.3  indicating  high  RNA  qual¬ 
ity.  The  two  groups  had  excellent  RNA  quality  with  mean 
RINs  of  8  ±  0.4  versus  8.05  ±  0.4  [mean  ±  standard  devi¬ 
ation  (SD)],  respectively. 

Relative  SNCA  transcript  abundance  in  venous  blood 
was  significantly  lower  for  patients  with  Parkinson’s  disease 
than  for  controls  (difference  of  20%;  P-value  =  0.0004; 
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Table  2  Blood  SNCA  expression  is  associated  with  early-stage  clinical  and  DAT-neuroimaging  supported  Parkinson’s 
disease  in  three  independent  populations  and  on  three  independent  assay  platforms 


Cases 

Controls 

Difference  of  means 

P-value* 

HBS 

Quantitative  PCR  platform0 

n  (total  n  =  405) 

Unadjusted  relative  SNCA  expression  (mean,  SD) 

222 

51.5  (1.97) 

183 

64.2  (1.8) 

— 20e 

0.0004 

Adjusted  relative  SNCA  expression01  (mean,  95%  Cl) 

49  (45-54) 

61  (55-68) 

—  I7e 

0.003 

PROBE 

Quantitative  PCR  platform0 

n  (total  n  =  118) 

Unadjusted  relative  SNCA  expression  (mean,  SD) 

76 

75.6  (2.27) 

42 

96.3  (1.7) 

— 22e 

0.025 

Adjusted  relative  SNCA  expression01  (mean,  95%  Cl) 

72  (61-85) 

96  (76-122) 

— 25e 

0.046 

Microarray  platform 

n  (total  n  -  1 42) 

ILMN-PROBE  la  Unadjusted  SNCA  expression  (mean,  SD) 

93 

2846  (1962) 

49 

3469  (1622) 

-18 

0.01 

ILMN-PROBEI3  Adjusted  SNCA  expressiond  (mean,  95%  Cl) 

2452  (2167-2775) 

3098  (2635-3644) 

-21 

0.02 

ILMN-PROBE2b  Unadjusted  SNCA  expression  (mean,  SD) 

681  (473) 

823  (432) 

-17 

0.02 

ILMN-PROBE2a  Adjusted  SNCA  expressiond  (mean,  95%  Cl) 

578  (508-657) 

730  (616-864) 

-21 

0.03 

PPMI 

Digital  gene  expression  platform 

n  (total  n  =  340) 

202 

138 

Unadjusted  analyses 

E3E4-SNCA  transcript  counts  (mean,  SD) 

13  458  (7812) 

15959  (12576) 

-16 

0.08 

E4E6-SNCA  transcript  counts  (mean,  SD) 

2826  (I860) 

3496  (3,1  19) 

-19 

0.04 

Long  3’UTR-SNCA  transcript  counts  (mean,  SD) 

573  (387) 

781  (1013) 

-27 

0.007 

SNCA-007  transcript  counts  (mean,  SD) 

23  (16) 

25  (21) 

-8 

0.6 

Adjusted  modelsd 

Adjusted  E4E6-SNCA  transcript  counts  (mean,  95%  Cl) 

2409  (2196-2643) 

2784  (2495-3107) 

-13 

0.046 

Adjusted  long  3’UTR-SNCA  transcript  counts  (mean,  95%  Cl) 

485  (440-534) 

578  (515-647) 

-16 

0.02 

aILMN-PROBEI  indicates  lllumina  probe  ILMNI70I933  that  targets  the  3'UTR  of  SNCA. 
bILMN-PROBE2  indicates  lllumina  probe  ILMNI766I65  that  targets  exon  5  of  SNCA. 

CAII  statistical  analyses  for  quantitative  PCR  assays  were  conducted  using  the  natural  logarithm  of  ACT  values.  For  presentation  purposes  in  Table  2  and  Figs  I  and  2,  a  linearizing 
transformation  of  mean  ACTs  was  performed.  Values  shown  are  in  arbitrary  units. 
dAdjusted  for  covariates  of  gender,  and  white  and  red  blood  cell  counts. 

indicates  per  cent  reduction  in  expression  in  cases  compared  to  controls  determined  using  the  comparative  CT  method, 
indicates  significance  analysis  for  all  measurement  values  was  performed  on  log-transformed  data. 
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Table  2  and  Fig.  1A).  Post  hoc  analysis  of  the  subgroup  of 
untreated,  de  novo  cases  showed  consistent  results  with  a 
>20%  lower  mean  relative  SNCA  transcript  abundance  in 
the  de  novo  cases  compared  to  the  controls  (P- 
value  =  0.01).  Counts  of  the  four  endogenous  reference 
genes  (Fig.  IB)  and  their  geometric  mean  (that  was  used 
to  normalize  for  input  RNA)  were  virtually  identical  in 
cases  and  controls  (Fig.  1C).  Importantly,  complete  blood 
counts  in  cases  and  controls  could  not  account  for  the  re¬ 
duction  in  relative  SNCA  mRNA  abundance  in  patients 
consistent  with  previous  reports  by  others  (Kasten  et  al., 
2010).  Specifically,  white  blood  cell  counts  (Fig.  ID)  and 
platelet  counts  (Fig.  IF)  were  unchanged  between  cases  and 
controls,  while  red  blood  cell  counts  were  marginally 
higher  in  cases  than  in  controls  (P  =  0.03;  Fig.  IE). 

General  linear  model  analysis  was  performed  adjusting 
for  the  covariates  of  counts  of  white  and  red  blood  cells, 
and  gender.  In  this  covariate-adjusted  analysis  the  mean 
relative  abundance  of  SNCA  expression  was  17%  lower 
in  cases  than  in  controls  with  P  =  0.003. 


Odds  ratios  (OR)  for  the  lowest  (first)  quartile  relative  to 
the  highest  (fourth)  quartile  for  blood  SNCA  transcript 
abundance  were  associated  with  Parkinson’s  disease  status 
in  the  unadjusted  analysis  and  remained  associated  follow¬ 
ing  adjustments  for  the  covariates  of  age,  hours  at  4°C,  red 
and  white  blood  cells,  and  platelets  (Model  1)  or  for  gender 
(Model  2)  with  odds  ratios  ranging  from  2.14  (95%  Cl, 
1.10-4.14)  to  2.15  (95%  Cl,  1.2-3.82)  (Table  3). 

PROBE  study 

Replication  of  low  SNCA  mRNA  abundance  in 
dopamine  transporter  imaging-supported 
Parkinson’s  disease 

To  be  of  use  in  clinical  trials,  it  must  be  feasible  to  measure 
a  biomarker  in  a  multicentre  study  design.  We  thus  further 
evaluated  SNCA  mRNA  levels  in  the  USA-wide,  multicen¬ 
tre  PROBE  study  (Supplementary  Table  1).  There  were 
no  significant  baseline  inequalities  for  age,  gender,  and 
white  and  red  blood  cell  counts.  Cases  had  a  mean 
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Figure  I  HBS:  reduced  SNCA  mRNA  abundance  in  early-stage  Parkinson’s  disease.  (A)  Mean  SNCA  expression  was  significantly 
lower  in  222  patients  with  early-stage  clinical  Parkinson’s  disease  (PD)  compared  to  183  controls  (HC)  without  neurologic  disease  enrolled  in  the 
HBS  using  precise,  quantitative  PCR  (unadjusted  difference  of  20%;  P-value  of  0.0004).  *P-value  =  0.01  for  untreated,  de  novo  cases  versus 
controls.  Box  plots  visualize  first,  third  quartiles  and  means;  the  ends  of  the  whiskers  represent  the  lowest  (or  highest)  value  still  within  1 .5-times 
the  interquartile  range.  (B)  Expression  values  of  the  four  endogenous  reference  genes  used  in  the  HBS  were  virtually  identical  in  cases  and 
controls.  Cycle  threshold  (CT)  values  (standard  error)  are  shown  for  cases  (crimson  bars)  and  controls  (grey  bars).  (C)  The  geometric  mean  of 
these  four  endogenous  reference  genes  was  used  to  normalize  for  input  RNA.  (D-F)  Importantly,  complete  blood  counts  in  cases  and  controls 
could  not  account  for  the  reduction  in  relative  SNCA  mRNA  abundance  in  patients.  Specifically,  white  blood  cell  counts  (D,  WBC)  and  platelet 
counts  (F)  did  not  significantly  differ  between  cases  and  controls,  while  red  blood  cell  counts  (RBC)  were  marginally  higher  in  cases  than  in 
controls  (E;  mean  red  blood  cell,  4.6  versus  4.5,  P  =  0.03).  Bar  graphs  show  means  and  standard  errors. 


Hoehn  and  Yahr  stage  of  2.0,  a  mean  total  UPDRS  score  of 
30.6,  and  mean  disease  duration  of  5.5  years.  The  clinical 
diagnosis  was  supported  by  dopamine  transporter  (DAT) 
imaging  with  123I-(3-CIT.  A  subset  of  76  cases  with 
Parkinson’s  disease  and  42  controls  with  available  high- 
quality  RNA  were  analysed  by  quantitative  PCR.  The 
mean  relative  abundance  of  SNCA  expression  was  22% 
lower  in  cases  than  in  controls  with  P  =  0.025  in  the  un¬ 
adjusted  analysis  (Table  2  and  Fig.  2A).  This  remained  sig¬ 
nificant  after  adjusting  for  the  covariates  of  white  and  red 
blood  cells,  and  gender  with  P  =  0.046.  Quantitative  PCR 
allows  for  precise  relative,  but  not  absolute  quantification. 
Thus,  the  difference  in  SNCA  mRNA  abundance  in  cases 
relative  to  controls  found  in  PROBE  was  comparable  to  the 
20%  difference  seen  in  HBS  (i.e.  in  Figs  1A  and  2A),  while 
the  absolute  expression  values  (in  arbitrary  units)  were  not 
directly  comparable  due  to  different  calibrators. 

We  confirmed  these  expression  changes  using  two  SNCA 
probes  represented  on  a  secondary,  microarray  expression 
platform.  Results  for  both  probes  were  available  for  93 


cases  and  49  controls  enrolled  in  PROBE  and  confirmed 
that  relative  blood  SNCA  expression  was  lower  in  cases 
compared  to  controls  (Table  2  and  Fig.  2B  and  C). 
Results  for  ILMN-PROBE1  confirmed  an  association  be¬ 
tween  SNCA  expression  and  Parkinson’s  disease  adjusting 
for  the  covariates  of  white  and  red  blood  cell  counts,  and 
gender  with  P  =  0.02.  The  association  was  further  con¬ 
firmed  by  ILMN-PROBE2  with  P  =  0.03,  again  adjusting 
for  the  same  covariates. 

Individuals  with  SNCA  transcript  abundance  in  the 
lowest  quartile  of  values  had  an  OR  for  Parkinson’s  disease 
of  4.5  (95%  Cl,  1.3-15.0)  compared  to  individuals  in 
the  highest  quartile  of  values  (Table  3).  The  corres¬ 
ponding  odds  ratios  for  the  two  microarray  probes  were 
4.4  (95%  Cl,  1.4-14.5)  and  4.6  (95%  Cl,  1.5-13.7), 
respectively.  Thus,  measuring  blood  SNCA  expression 
was  feasible  in  a  multicentre  study  and  reduced  blood 
SNCA  expression  in  Parkinson’s  disease  was  confirmed  in 
this  independent  population  on  two  distinct  analysis 
platforms. 
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Table  3  The  odds  ratio  for  Parkinson’s  disease  prevalence  is  increased  in  individuals  with  blood  SNCA  expression  in 
the  lowest  quartile  of  values  compared  to  individuals  with  SNCA  expression  in  the  highest  quartile  of  values  in  three 
study  populations 


Quartile 

i 

2 

3 

4 

HBS 

Relative  SNCA  mRNA  abundance 

5-3  7a 

37-60a 

60-9 1 a 

9 1 -294a 

Parkinson’s  disease  (n  =  222),  nl total  (%) 

67/222  (30) 

58/222  (26) 

50/222  (23) 

47/222  (21) 

Healthy  control  (n  =  1 83),  nl  total  (%) 

34/183  (18.6) 

43/183  (23.5) 

51/183  (27.9) 

55/183  (30) 

Unadjusted  OR,  (95%  Cl)b 

2.31  (1.31— 4.1) 

1.58  (0.91-2.75) 

1.15  (0.66-1.99) 

1 

P- value 

0.0036 

0.1  1 

0.63 

Model  1,  OR  (95%  Cl)c 

2.14  (1. 10-4.14) 

1.86  (1.01-3.42) 

1.07  (0.6-1.92) 

1 

P- value 

0.024 

0.046 

0.82 

Model  2,  OR  (95%  Cl)d 

2.15  (1.2-3.82) 

1.61  (0.92-2.83) 

1.12  (0.64-1.96) 

1 

P- value 

0.0096 

0.098 

0.69 

PROBE® 

Relative  SNCA  mRNA  abundance 

1 6-52a 

56-79a 

84-1 I9a 

1 1 9-2702a 

Parkinson’s  disease  ( n  =  76),  n  /total  (%) 

24/76  (32) 

20/76  (26) 

17/76  (22) 

15/76  (20) 

Healthy  control  (n  =  42),  nl  total  (%) 

5/42  (12) 

10/42  (24) 

13/42  (31) 

14/42  (33) 

OR  (95%  Cl)w 

4.5  (1.3-15.0) 

1 .9  (0.7-5. 3) 

1.2  (0.4-3. 4) 

1 

P- value 

0.02 

0.90 

0.25 

PPMI 

Long-3’UTR-SNCA  transcript  counts 

59-328 

335-497 

505-786 

796-10  861 

Parkinson’s  disease  ( n  =  202),  n/total  (%) 

58/202  (29) 

48/202  (24) 

55/202  (27) 

41/202  (20) 

Healthy  control  (n  =  1 38),  n/total  (%) 

27/138  (19.6) 

37/138  (26.8) 

30/138  (21.7) 

44/138  (31.9) 

Unadjusted  OR,  95%  Cl)b 

2.3  (1. 2-4.3) 

1 .4  (0.8-2.5) 

1.97  (1. 1-3.6) 

1 

P- value 

0.009 

0.28 

0.03 

Adjusted  OR  (95%  Cl)g 

2.1  (1. 1-4.0) 

1 .4  (0.8-2.6) 

2.1  (1. 1-3.9) 

1 

P- value 

0.02 

0.25 

0.02 

aAII  statistical  analyses  for  quantitative  PCR  assays  were  conducted  using  ACT  values.  For  presentation  purposes  the  ranges  of  each  quartile  of  values  shown  were  linearly 
transformed,  e.g.  2'lowest  ACT  and  rhighest  ACT. 

bUnadjusted  odds  ratio  (OR)  and  confidence  intervals  (Cl)  were  calculated  using  logistic  regression  models  comparing  the  risk  of  each  of  quartiles  I  to  3  to  the  reference  quartile  4 
(highest  relative  SNCA  transcript  abundance). 

cModel  I  consists  of  covariates  age,  hours  in  4°C,  red  blood  cells,  platelets,  and  white  blood  cells  in  addition  to  the  SNCA  quartiles. 
dModel  2  consists  of  covariate  sex  in  addition  to  the  SNCA  quartiles. 

ePROBE  results  on  quantitative  PCR  platform  are  shown,  similar  results  were  obtained  on  the  microarray  platform. 
fNo  adjustments  were  indicated. 

8Model  consists  of  the  covariate  white  blood  cells  in  addition  to  the  quartiles  of  long  3’UTR-SNCA  counts. 


PPMI 

Reduced  abundance  of  disease-relevant  SNCA 
isoforms  already  near  disease  onset 

These  associations  prompted  us  to  further  extend  and 
verify  our  findings  in  patients  near  disease  onset,  when  in¬ 
dividual  motor  symptoms  are  just  beginning  to  manifest. 
Two  hundred  and  two  cases  with  de  novo  motor 
Parkinson’s  disease,  with  less  than  2  years  of  disease  and 
with  confirmed  loss  of  dopaminergic  substantia  nigra  ter¬ 
minals  by  DAT  imaging  and  138  age-  and  sex-matched 
controls  without  neurologic  disease  and  with  normal 
DAT  imaging,  who  met  stringent  RNA  quality-control  cri¬ 
teria  (RIN  ^  7.3)  were  included.  Key  eligibility  criteria  for 
PPMI  require  the  presence  of  asymmetric  resting  tremor  or 
asymmetric  bradykinesia,  a  supportive  DAT  scan,  and  a 
diagnosis  of  Parkinson’s  disease  for  2  years  or  less,  without 
current  dopamine  replacement  medication  (Parkinson 
Progression  Marker  Initiative,  2011).  Patients  therefore 


were  clinically  at  a  very  early  stage  and  did  not  yet  neces¬ 
sarily  fulfil  the  standard,  full  clinical  diagnostic  criteria  of 
the  UK  Parkinson’s  Disease  Society  Brain  Bank  that  require 
bradykinesia  and  a  second  mandatory  clinical  cardinal  sign 
and  three  supportive  clinical  criteria  to  be  present  for  diag¬ 
nosis  (Hughes  et  al.,  1992).  Cases  and  controls  were  fre¬ 
quency  matched  for  age  and  gender  (Supplementary  Table 
2).  The  PPMI  population  was  remarkable  for  the  virtual 
absence  of  Parkinson’s  disease  medications,  and  the  equal 
distribution  of  potentially  confounding  variables  of  age, 
gender,  and  blood  cell  counts  among  cases  and  controls 
(Supplementary  Table  2  and  Fig.  3G-I). 

The  advent  of  digital  expression  analysis  with  molecular 
barcodes  (Geiss  et  al.,  2008;  Maikov  et  al.,  2009)  allowed 
us  to  count  the  abundance  of  SNCA  transcripts  directly  in 
RNA  from  blood  cells,  without  the  need  for  reverse  tran¬ 
scription  or  PCR  amplification.  No  signal  was  detected  in 
controls  lacking  template.  To  avoid  run-order  bias,  samples 
of  cases  or  controls  were  randomly  assigned  to  plates. 
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Figure  2  USA-wide,  multicentre  PROBE  study:  replication  of  low  SNCA  mRNA  abundance  in  DAT  imaging-confirmed 
Parkinson’s  disease.  (A)  We  further  evaluated  this  association  in  76  cases  with  DAT  imaging-confirmed  Parkinson’s  disease  and  42  controls 
enrolled  in  the  PROBE  study.  A  22%  reduction  in  levels  of  blood-SNCA  expression  was  seen  in  cases  compared  to  controls  on  the  quantitative 
PCR  platform  (P  =  0.025).  (B  and  C)  These  results  were  technically  replicated  on  a  second,  independent  microarray  expression  platform  that 
includes  two  distinct  probes  for  SNCA.  Results  for  each  of  the  two  probes  confirmed  that  relative  blood-SNCA  expression  was  lower  in  cases 
compared  to  controls.  Box  plots  visualize  first,  third  quartiles  and  means;  the  ends  of  the  whiskers  represent  the  lowest  (or  highest)  value  still 
within  1.5-times  the  interquartile  range. 


Transcript  counts  (excluding  SNCA- 007)  measured  in  ref¬ 
erence  RNA  were  highly  correlated  with  R2  >  0.999  both 
within  one  plate  and  in-between  different  plates,  thus 
excluding  drift  as  a  potential  source  of  bias.  Furthermore, 
5%  of  participants’  samples  were  randomly  resampled  to 
verify  the  retest  reliability  (technical  precision).  The  average 
R2  was  0.98  for  these  correlations. 

Alternative  SNCA  transcript  isoform  usage  has  been  sug¬ 
gested  as  a  potential  convergent  mechanism  in  Parkinson’s 
disease  pathology  (Rhinn  et  al.,  20f2).  We  therefore  de¬ 
signed  multiple  probes  (Supplementary  Fig.  1)  to  target 
the  boundaries  of  exon  3  and  exon  4  (henceforth  termed 
E3E4-SNCA  transcript  isoform),  transcripts  specifically 
with  a  long  3’UTR  region  (which  may  redirect  a-synuclein 
towards  mitochondria;  Rhinn  et  al.,  2012),  transcripts  that 
skip  exon  5  (termed  E4E6-SNCA  transcript  isoform)  and 
are  translated  into  a  truncated,  aggregation-prone  112 
amino  acid  protein  (Murray  et  al.,  2003),  or  the  rare 
short  SNCA- 007  transcript  isoform  that  comprises  exons 
1-4.  Relative  counts  of  long  3’UTR-SNCA  transcripts 
were  reduced  by  27%  in  cases  with  Parkinson’s  disease 
compared  to  controls  in  the  unadjusted  analysis  with 
P  =  0.007  (Table  2  and  Fig.  3A).  This  was  confirmed 
with  a  second  digital  molecular  probe  directed  at  a  different 
target  sequence  in  long-3’UTR-SNCA  transcripts.  The  rela¬ 
tion  between  long-3’UTR-SNCA  transcripts  and 
Parkinson’s  disease  remained  significant  with  a  P-value  of 
0.02  after  adjusting  for  the  covariates  of  white  blood  cell, 
red  blood  cell,  and  gender  (Table  2).  Counts  of  exon  5- 
skipping  transcripts  were  also  associated  with  Parkinson’s 
disease  in  unadjusted  and  adjusted  analyses  with  P-values 
of  0.04  and  0.046,  respectively  (Table  2  and  Fig.  3B). 
Counts  for  E3E4-SNCA  transcripts  were  16%  lower  in 
cases  compared  to  controls  but  did  not  reach  significance. 
Counts  of  the  rare  SNCA-007  transcript  did  not  signifi¬ 
cantly  differ  between  cases  and  controls.  To  estimate 


whether  SNCA  expression  is  specially  associated  with 
Parkinson’s  disease  or  whether  Parkinson’s  disease-linked 
transcripts  are  generally  perturbed  in  circulating  blood 
cells  of  patients  with  Parkinson’s  disease,  we  examined 
the  expression  of  loci  linked  to  familial  Parkinson’s  disease 
[PARK7  (also  known  as  DJ-1)  and  PARK15  (also  known 
as  FBX07)]  and  ZNF746  (implicated  in  mediating  the  ef¬ 
fects  of  mutant  PARK2;  Shin  et  al.,  2011).  Furthermore, 
the  QDPR  gene  produces  an  essential  cofactor  for  dopa¬ 
mine  biosynthesis  and  was  also  evaluated.  These 
Parkinson’s  disease-linked  transcripts  were  similarly  ex¬ 
pressed  in  cases  and  controls  with  no  or  minimal  fold- 
changes  of  <5%  and  non-significant  P-values  [although 
one  of  five  PARK15/FBX07  isoforms  tested  (FBX07- 
001)  showed  a  marginal  trend  with  P  =  0.06].  For  example, 
mean  counts  for  PARK7  mRNAs  were  1144  in  de  novo 
cases  and  1160  in  controls  (1.5%  change,  P  =  0.4;  Fig.  3C). 
Counts  (Fig.  3D  and  E)  and  geometric  mean  (Fig.  3F)  of  the 
six  endogenous  reference  genes  (used  to  normalize  for  input 
RNA)  were  nearly  identical  in  cases  and  controls.  Cases 
and  controls  did  not  significantly  differ  in  white  cells,  red 
cells,  and  platelets  counts  (Fig.  3G-I). 

Individuals  with  long-3 ’UTR-SNCA  transcript  counts  in 
the  lowest  quartile  of  values  had  an  OR  >  2  for 
Parkinson’s  disease  compared  to  individuals  in  the  highest 
quartile  of  values  in  both  the  unadjusted  analysis  and  ana¬ 
lyses  adjusted  for  white  blood  cell  count,  the  only  co variate 
retained  in  a  step-wise  backward  elimination  procedure 
(Table  3). 

Risk  of  Parkinson’s  disease  associated 
with  relative  SNCA  transcript 
abundance 

Clinically  useful  biological  markers  of  complex  diseases  can 
have  modest  areas  under  the  curve  (AUC)  and  a  wide  range 
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Figure  3  PPMI:  disease-relevant  SNCA  isoforms  are  already  reduced  near  disease  onset.  (A  and  B)  SNCA  isoforms  were  probed  on 
a  highly  robust  digital  expression  platform  in  202  cases  with  de  novo  motor  Parkinson’s  disease  (PD),  with  less  than  2  years  of  disease  and  with 
confirmed  loss  of  dopaminergic  substantia  nigra  terminals  by  DAT  imaging  and  138  age-  and  sex-matched  controls  (HC)  without  neurological 
disease  and  normal  DAT  imaging  participating  in  PPMI.  Unadjusted  counts  of  disease-relevant  SNCA  transcript  isoforms  with  long  3’  UTR  (A)  or 
skipping  exon  5  (B),  were  reduced  by  27%  and  19%,  respectively,  in  de  novo  cases  compared  to  controls  (P  =  0.007  and  0.04,  respectively).  (C)  By 
contrast  mean  counts  for  the  gene  PARK7  (DJ-I)  mutated  in  rare  autosomal  recessive  Parkinson’s  disease  were  not  significantly  different  in  de  novo 
cases  with  sporadic  Parkinson’s  disease  compared  to  controls  (difference  of  1.5%).  Box  plots  visualize  first,  third  quartiles  and  means;  the  ends  of 
the  whiskers  represent  the  lowest  (or  highest)  value  still  within  1 . 5-times  the  interquartile  range.  (D-F)  Reference  genes  were  stably  expressed  in 
blood  cells  of  cases  compared  to  controls.  Counts  of  the  six  endogenous  reference  genes  used  in  PPMI  (D  and  E)  were  virtually  identical  in  the 
202  cases  and  I  38  controls.  Mean  counts  (standard  error)  are  shown  for  cases  (orange  bars)  and  controls  (grey  bars).  (F)  The  geometric  mean  of 
these  six  endogenous  reference  genes  was  used  to  normalize  for  input  RNA.  It  was  nearly  identical  in  cases  and  controls.  (G-l)  White  (WBC)  and 
red  (RBC)  blood  cell,  and  platelet  counts  did  not  differ  significantly  between  cases  and  controls.  Bar  graphs  show  means  and  standard  errors. 


of  values  that  overlap  in  subjects  with  a  clinical  disease  and 
in  those  without  it  (Manolio,  2003).  Consistent  with  this 
notion,  SNCA  transcripts  showed  AUCs  for  Parkinson’s 
disease  in  the  range  of  a  possible  risk  marker.  The  AUC 
for  SNCA  transcripts  as  a  predictor  of  Parkinson’s  disease 
was  0.6  in  HBS,  ranged  from  0.63  to  0.66  in  PROBE  (for 
quantitative  PCR  and  microarray  probes,  respectively),  and 
was  0.58  in  PPMI. 


A  meta-analysis  was  made  possible  by  the  similar  collec¬ 
tion  protocols  between  the  cohorts  and  allowed  for  power¬ 
fully  combining  the  three  studies  to  estimate  the  association 
between  SNCA  transcript  abundance  and  risk  of  early-stage 
Parkinson’s  disease.  It  increased  the  sample  size  to  500 
cases  and  363  controls  (total  of  863).  SNCA  variables  esti¬ 
mated  on  quantitative  PCR  and  digital  expression  plat¬ 
forms  were  standardized  and  ranked.  The  covariates  of 
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Summary  OR  [95%  Cl] 

All  studies  2.45  [1 .65,3.64] 

Study  specific  ORs 
HBS  2.31  [1.31,4.1] 

PROBE  4.50  [1.30,15] 

PPMI  2.30(1.20,4.3] 


OR  (lowest  quartile  versus  highest  quartile) 


Figure  4  Risk  of  Parkinson’s  is  associated  with  relative 
SNCA  transcript  abundance.  Individuals  with  SNCA  transcript 
levels  in  the  lowest  quartile  of  values  had  a  summary  odds  ratio  for 
Parkinson’s  disease  of  2.45  (1.65,  3.64)  compared  to  individuals  with 
levels  in  the  highest  quartile  in  a  meta-analysis  across  the  three 
studies.  In  the  Forrest  plot  squares  represent  the  estimate  of  effect, 
i.e.  the  OR  for  each  individual  study,  the  size  of  the  square  cor¬ 
responds  to  the  size  of  the  study,  and  bars  around  the  square  rep¬ 
resent  the  95%  Cl  of  the  odds  ratio.  The  diamond  indicates  the 
summary  estimate;  the  width  (the  horizontal  tips)  of  the  diamond 
represents  the  95%  Cl  of  the  summary  OR. 


white  and  red  blood  cell  count,  gender,  and  age  were  ad¬ 
justed  for  in  the  meta-analysis.  For  PPMI  we  first  conser¬ 
vatively  included  expression  measures  for  the  generic  probe 
(E3E4-SNCA)  that  targets  most  SNCA  transcripts.  In  a 
second  iteration,  we  included  the  specific  probe  targeting 
long  3’UTR-SNCA.  In  both  scenarios  the  association  be¬ 
tween  SNCA  blood  transcripts  and  early  Parkinson’s  dis¬ 
ease  was  highly  significant  with  P  =  0.0001  and 
P  <  0.0001,  respectively.  Overall,  individuals  with  SNCA 
transcript  levels  in  the  lowest  quartile  had  a  highly  signifi¬ 
cant  OR  for  Parkinson’s  disease  of  2.18  (96%  Cl,  1.47- 
3.22)  and  2.45  (96%  Cl,  1.65-3.64;  Fig.  4)  for  each  of  the 
two  scenarios,  respectively.  Cochran’s  Q  statistic  equalled 
2.67  ( P  =  0.26),  giving  no  evidence  of  heterogeneity  among 
the  three  studies. 

Taken  together  these  results  indicate  that  reduced  SNCA 
mRNA  levels  in  circulating  cells  are  associated  with 
Parkinson’s  disease,  even  near  disease  onset  and  in  patients 
with  DAT-imaging  supported  loss  of  dopaminergic  nigral 
projections.  Exploration  of  specific  transcript  isoforms 
using  digital  transcript  counting  indicated  that  SNCA  tran¬ 
scripts  containing  a  long  3’UTR  may  be  particularly  rele¬ 
vant  for  this  association. 


SNCA  transcript  abundance  predicts 
cognitive  decline 

Predictors  of  cognitive  decline  would  be  valuable  for  strat¬ 
ifying  clinical  trials  and  for  advancing  clinical  care  (Chen- 
Plotkin,  2014).  We  have  tracked  Mini-Mental  State 
Examination  (MMSE)  scores  in  the  HBS  and  the  PROBE 
populations  during  multiple  longitudinal  in-person  follow¬ 
up  visits  for  ~5  years.  To  evaluate  whether  SNCA 
transcript  levels  measured  at  enrolment  can  longitudinally 


predict  cognitive  prognosis  in  Parkinson’s  disease,  we  ana¬ 
lysed  the  222  patients  with  Parkinson’s  disease  that  were 
longitudinally  followed  in  the  HBS.  In  HBS,  MMSE  scores 
were  obtained  during  in-person  study  visits  at  enrolment, 
after  12,  24,  and  60  months  in  the  study.  There  was  overall 
a  significant  relation  between  SNCA  transcript  abundance 
at  enrolment  and  MMSE  scores  during  the  longitudinal 
follow-up  period  with  P  =  0.0007.  We  then  tested  whether 
low  versus  high  SNCA  expression  status  measured  at  en¬ 
rolment  could  predict  longitudinal  cognitive  decline  in  these 
patients.  The  25%  of  patients  with  Parkinson’s  disease  with 
the  highest  SNCA  transcript  levels  at  enrolment  (n  =  55; 
termed  henceforth  ‘SNCA  high  expressors’)  and  the  25% 
of  patients  with  the  lowest  SNCA  transcript  levels  at  enrol¬ 
ment  (n  =  55;  termed  ‘SNCA  low  expressors’)  were  com¬ 
pared  (Fig.  5).  Notably,  at  enrolment  high  and  low 
SNCA  expressors  had  nearly  identical  mean  (SD)  MMSE 
scores  of  28.37  (1.95)  versus  28.53  (2.28)  ( P  =  0.3). 
Moreover,  the  two  groups  showed  no  statistically  signifi¬ 
cant  differences  in  other  clinical  characteristics  at  enrol¬ 
ment,  including  gender,  age,  disease  duration,  total 
Unified  Parkinson’s  Disease  Rating  Scale  (UPDRS)  score, 
Hoehn  and  Yahr  stage,  medications,  and  years  of  education 
(Supplementary  Table  3).  Notably,  low  SNCA  expressors 
showed  accelerated  longitudinal  cognitive  decline  compared 
to  high  expressors  with  P  =  0.01  in  the  longitudinal  mixed 
random  and  fixed  effects  model  analysis  (Locascio  and  Atri, 
2011)  adjusting  for  age,  gender,  disease  duration  upon  en¬ 
rolment,  and  years  of  education  (Fig.  5A). 

We  then  asked  whether  low  and  high  expressor  status 
can  also  predict  cognitive  decline  in  PROBE.  Prospective 
longitudinal  clinical  data  were  available  for  65  of  the  76 
patients  in  PROBE  with  relative  SNCA  abundance  mea¬ 
sured  by  quantitative  PCR  at  enrolment.  Again,  the  25% 
of  patients  with  Parkinson’s  disease  with  the  highest  SNCA 
transcript  levels  at  enrolment  into  PROBE  (« =  16  in 
PROBE)  and  the  25%  of  patients  with  the  lowest  SNCA 
transcript  levels  at  enrolment  («=16  in  PROBE)  were 
tested.  High  and  low  SNCA  expressors  had  similar 
MMSE  scores  of  29.19  (0.98)  versus  28.44  (2.78)  at  enrol¬ 
ment  (P  =  0.95)  (Supplementary  Table  2).  Mixed  effects 
analysis  adjusting  for  the  covariates  of  age,  gender,  disease 
duration  upon  enrolment,  and  years  of  education  indicated 
a  similar  trend  as  observed  in  the  larger  HBS  population, 
but  this  did  not  reach  statistical  significance,  likely  because 
of  insufficient  power  (Fig.  5B).  PPMI  was  not  included  in 
the  longitudinal  analysis  because  the  multi-year  longitu¬ 
dinal  follow-up  data  needed  are  not  yet  available.  Finally, 
to  increase  power,  we  performed  a  mixed  random  and 
fixed  effects  meta-analysis  across  the  low  and  high  expres¬ 
sors  from  each  of  the  HBS  and  PROBE  populations.  The 
combined  analysis  robustly  confirmed  SNCA  expression 
status  as  predictor  of  longitudinal  cognitive  decline  in 
Parkinson’s  disease  with  P  =  0.005  after  adjusting  for  per¬ 
tinent  covariates  and  for  study  (Fig.  5C).  Similar  analyses 
were  performed  to  evaluate  whether  SNCA  expression  can 
also  predict  decline  in  longitudinal  total  UPDRS  scores. 
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Figure  5  SNCA  transcript  abundance  predicts  cognitive  decline  in  patients  with  Parkinson’s  disease  during  longitudinal 
follow-up.  (A)  The  25%  patients  with  Parkinson’s  disease  with  the  highest  SNCA  transcript  levels  at  enrolment  into  HBS  (n  =  55;  ‘SNCA  high 
expressors’)  and  the  25%  of  patients  with  the  lowest  SNCA  transcript  levels  at  enrolment  in  HBS  (n  =  55;  ‘low  SNCA  expressors’)  were  compared. 
Illustrative  mean  scores  on  the  MMSE  across  time  predicted  from  the  estimated  fixed  effect  parameters  in  the  mixed  random  and  fixed  effects 
model  analysis  (Locascio  and  Atri,  2011)  are  shown  for  SNCA  high  expressors  and  SNCA  low  expressors.  Low  SNCA  expressors  showed 
accelerated  longitudinal  cognitive  decline  compared  to  high  expressors  with  P  =  0.01  adjusting  for  age,  gender,  disease  duration  upon  enrolment, 
and  years  of  education.  (B)  Mixed  effects  model  analysis  for  the  25%  SNCA  high  expressors  (n  =  16)  versus  the  25%  low  expressors  in  PROBE 
(n  =  16)  adjusting  for  covariates  indicated  a  similar,  but  non-significant  trend.  (C)  In  the  combined  analysis,  SNCA  expression  status  was  robustly 
confirmed  as  a  predictor  of  longitudinal  cognitive  decline  in  Parkinson’s  disease  with  P  =  0.005  after  adjusting  for  pertinent  covariates  and  study. 
Illustrative  mean  scores  on  the  MMSE  across  time  predicted  by  the  fitted  model  in  the  longitudinal  analysis  for  Parkinson’s  disease  patients  in  the 
lowest  quartile  of  SNCA  expression  levels  are  shown  as  red  triangles;  values  for  Parkinson’s  disease  patients  in  the  highest  quartile  of  SNCA 
expression  levels  are  represented  as  blue  circles  (solid  lines  indicate  predicted  values  for  subjects  with  a  mean  value  of  disease  duration  at 
enrolment;  large-dashed  lines  indicate  those  for  individuals  with  I  SD  longer  disease  duration  at  enrolment;  and  short-dashed  lines  indicate  those 
for  subjects  with  I  SD  shorter  disease  duration  at  enrolment). 


These  analyses  were  inconclusive  and  will  require  further 
evaluation  in  larger,  longitudinally  followed  patient 
populations. 

Discussion 

We  show  here  that  SNCA  transcript  counts  in  circulating 
blood  cells  are  reduced  in  early  Parkinson’s  disease — even 
in  the  landmark  PPMI  study  that  enrolled  patients  before 
they  met  standard  clinical  diagnostic  criteria.  The  associ¬ 
ation  between  SNCA  expression  and  Parkinson’s  disease 
was  generalizable  across  platforms  and  across  populations 
(Table  2  and  Figs  1—3).  It  was  observed  in  studies  from  a 
single  health  care  system  (HBS),  from  22  USA  sites 
(PROBE),  and  from  24  USA  and  international  sites 
(PPMI).  This  association  remained  significant  when  ad¬ 
justed  for  confounding  variables  such  as  age,  gender, 
white  and  red  blood  cell  counts,  Parkinson’s  disease  medi¬ 
cations,  or  sample  handling. 

Biological  markers  of  processes  that  lead  to  complex  dis¬ 
eases  such  as  Parkinson’s  disease  are  not  simply  present  or 
absent,  but  have  a  wide  range  of  values  that  overlap  in 
subjects  with  a  clinical  disease  and  in  those  without  it 
(Manolio,  2003).  For  example,  total  cholesterol  has 
modest  specificity  and  sensitivity  for  predicting  cardiovas¬ 
cular  disease  (with  an  area  under  the  receiver  operating 


curve  of  0.59  in  the  Women’s  Health  Study;  Ridker 
et  al.,  2000) — well  short  of  the  >90%  specificity  required 
for  a  bona  fide  diagnostic  test.  Nonetheless,  biological  mar¬ 
kers  of  complex  diseases  can  have  powerful  clinical  appli¬ 
cations.  Cholesterol  metabolism-related  biomarkers  have 
transformed  cardiology.  They  enable  early  intervention 
with  cholesterol-lowering  medications  because  of  the  fa¬ 
vourable  risk/benefit  ratio  (low  side-effect  profile  combined 
with  clear  benefits  for  reducing  debilitating  cardiovascular 
events).  Levels  of  these  commonly  used  biomarkers  are  also 
valuable  in  tracking  therapeutic  response  and  patient  strati¬ 
fication.  Thus,  if  confirmed  in  prospective  studies,  SNCA 
blood  transcripts  could  be  developed  into  future  markers  of 
risk,  of  patient  stratification  for  clinical  trials,  and  of  re¬ 
sponse  to  novel  therapeutics  (particularly  those  targeting  oc- 
synuclein  production  or  clearance). 

We  have  begun  to  explore  a  clinical  application  for  this 
biological  marker  as  a  predictor  of  cognitive  decline. 
Cognitive  decline  is  one  of  the  most  debilitating  manifest¬ 
ations  of  disease  progression  in  Parkinson’s  and  is  closely 
correlated  with  the  presence  of  a-synuclein  protein-positive 
Lewy  body  burden  throughout  the  neocortex  (Braak  et  al., 
2002).  Predictors  of  cognitive  decline  would  be  valuable  for 
stratifying  clinical  trials  and  for  advancing  clinical  care 
(Chen-Plotkin,  2014).  Several  previous  studies  have  exam¬ 
ined  the  association  between  cognitive  decline  and  biolo¬ 
gical  markers  in  cross-sectional  studies  (Mollenhauer  et  al., 
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2014)  or  with  a  single  longitudinal  follow-up  after  1-2 
years  (Siderowf  et  al.,  2010;  Chen-Plotkin  et  al.,  2011). 
Longitudinal  studies  with  repeated  measurements  provide 
more  independent  information  than  a  single  measurement 
(Hedeker  and  Gibbons,  2006;  Locascio  and  Atri,  2011).  In 
our  study  low  SNCA  transcript  abundance  at  enrolment 
predicted  cognitive  decline  in  patients  with  Parkinson’s  dis¬ 
ease  that  were  cognitively  reassessed  during  multiple  longi¬ 
tudinal  visits  over  up  to  5  years  of  follow-up  with  a  P-value 
of  0.005  in  the  combined  analysis  of  HBS  and  PROBE 
studies. 

Beyond  these  translational  implications,  the  association 
here  observed  offers  an  immediate  clue  towards  unravelling 
the  biological  function  of  a-synuclein  in  health  and  disease. 
a-Synuclein  had  been  thought  of  as  a  neuron-specific  gene 
for  decades,  but  this  view  was  turned  on  its  head  with  the 
discovery  of  high  levels  of  a-synuclein  present  in  blood 
cells,  particularly  in  reticulocytes  (circulating  immature 
red  blood  cells)  and  during  terminal  erythroid  differenti¬ 
ation  and  in  megakaryocytes  (Scherzer  et  al.,  2008; 
Maitta  et  al.,  2011),  with  low  or  no  expression  in  other 
tissues  (Maroteaux  et  al.,  1988).  In  erythroid  cells  SNCA 
expression  is  regulated  by  the  GATA-1  transcription  factor 
in  synchrony  with  machinery  essential  for  incorporating 
iron  into  heme  within  mitochondria  (Scherzer  et  al., 
2008)  thereby  pointing  at  a  biological  link  between  three 
nodes  of  Parkinson’s  disease:  a-synuclein,  mitochondria 
(Zheng  et  al.,  2010),  and  iron  (Rhodes  and  Ritz,  2008). 
Our  study  suggests  that  this  link  might  be  perturbed  in 
early  disease  stages.  Alternative  a-synuclein  transcript  iso¬ 
form  usage  has  been  suggested  as  a  potential  convergent 
mechanism  in  Parkinson’s  disease  pathology  (Rhinn  et  al., 
2012).  3’UTR  ends  of  mRNAs  are  critical  for  targeting 
transcripts  to  specific  subcellular  compartments  and  for 
translational  control  (Andreassi  and  Riccio,  2009). 
Specific  transcript  isoforms  of  SNCA  with  an  extended 
3’UTR  impact  accumulation  of  a-synuclein  protein,  which 
appears  redirected  away  from  synaptic  terminals  and  to¬ 
wards  mitochondria  (Rhinn  et  al.,  2012).  This  might  be 
relevant  to  disease  processes  as  mitochondrial  dysfunction 
is  implicated  in  the  onset  of  Parkinson’s  disease  neuropath¬ 
ology  (Zheng  et  al.,  2010).  By  contrast,  deletion  of  exon  5 
of  the  SNCA  gene  results  in  the  translation  of  a  truncated 
112  amino  acid  protein  that  is  prone  to  aggregation  and 
increases  seeding  efficiency  (Murray  et  al.,  2003). 

In  sporadic  Parkinson’s  disease,  a-synuclein  protein  (and 
iron)  accumulates  in  the  brain  and  peripherally  a-synuclein 
aggregates  are  found  in  skin,  glands,  intestines  and  epicar- 
dium  (reviewed  in  Jellinger,  2011).  In  a  few  dozen  patients 
a  triplication  of  the  SNCA  locus  causes  monogenic  familial 
Parkinson’s  disease.  The  direction  of  the  change  observed 
here  was  therefore  surprising,  even  paradoxical,  at  first 
glance.  How  can  it  be  that  SNCA  blood  transcript  levels 
are  consistently  reduced  in  each  of  the  three  studies?  Ample 
precedent  exists  for  paradoxical  changes  in  peripheral  mar¬ 
kers  of  neurological  diseases.  In  Parkinson’s  disease,  extra¬ 
cellular  a-synuclein  protein  levels  are  reduced  in  CSF  for 


reasons  unknown  (Mollenhauer  et  al.,  2011).  In  Wilson’s 
disease,  copper  accumulates  in  the  basal  ganglia,  but  serum 
levels  are  paradoxically  low.  In  Alzheimer’s  disease,  amyl¬ 
oid^!  levels  are  reduced  in  CSF  (Hansson  et  al.,  2006). 
We  hypothesize  that  intracellular  SNCA  transcript  levels  in 
clinically  manifest,  sporadic  Parkinson’s  disease  may  be 
reduced  as  a  consequence  of  intracellular  a-synuclein  pro¬ 
tein  accumulation  (due  to  impaired  clearance  or  other 
mechanisms)  that  leads  to  a  feedback  repression  of  tran¬ 
scription — in  a  homeostatic  effort  to  ameliorate  the  ‘log 
jam’  of  excessive  a-synuclein  protein  products.  In  an  alter¬ 
nate  view,  because  SNCA  is  so  highly  expressed  in  reticu¬ 
locytes,  reduced  SNCA  expression  could  be  a  marker  of 
generally  abnormal  reticulocyte  biogenesis  or  function  in 
early-stage  Parkinson’s  disease.  We  performed  association 
studies  that  do  not  imply  causality  and  that  were  not  de¬ 
signed  to  distinguish  between  these  alternative  hypotheses. 
However,  neither  ours  nor  previous  studies  (Kasten  et  al., 
2010)  found  evidence  for  an  association  between  anaemia 
and  Parkinson’s  disease.  By  contrast,  anaemia  during  early 
life,  preceding  disease  onset  by  up  to  20-29  years  (Savica 
et  al.,  2009),  and  multiple  recent  blood  donations  have 
been  correlated  to  increased  risk  of  Parkinson’s  disease 
(Logroscino  et  al.,  2006). 

The  ease  and  non-invasiveness  of  an  a-synuclein  blood  test 
is  appealing.  Previous  attempts  to  search  for  a-synuclein 
blood  tests  centred  on  a-synuclein  plasma  protein,  involved 
small  cohorts,  and  were  inconclusive  (for  review  see  Chahine 
et  al.,  2014).  That  approach  poses  considerable  technical 
challenges  as  plasma  can  be  easily  contaminated  with  a- 
synuclein  released  from  erythrocytes  ruptured  during 
sample  transport  and  centrifugation.  Our  studies  leapfrogged 
this  technical  challenge.  Patient  blood  was  directly  collected 
into  an  FDA-approved  system  for  preserving  ‘intracellular 
RNA  from  whole  blood  for  RT-PCR  used  in  molecular  diag¬ 
nostic  testing’  (without  a  need  for  plasma  separation)  where 
chemicals  immediately  stabilize  the  in  vivo  gene  transcription 
profile  by  reducing  in  vitro  RNA  degradation  and  minimiz¬ 
ing  gene  induction  (Pahl  and  Brune,  2002a,  b;  Rainen  et  al., 
2002;  Kim  et  al,  2014).  The  HBS,  PROBE  and  PPMI  stu¬ 
dies  were  purposefully  designed  to  minimize  the  threats  of 
bias  from  sample  collection,  processing  and  storage 
(Ransohoff,  2004).  Case  and  control  samples  for  HBS  and 
PPMI  were  collected,  processed  and  analysed  in  parallel 
using  standardized  procedures  for  each  step.  In  contrast  to 
other  studies,  RNA  integrity,  complete  blood  counts,  and 
several  processing  and  quality  indicators  were  carefully  mea¬ 
sured  for  all  three  studies. 

Although  each  of  the  three  biomarker  studies  here  pre¬ 
sented  has  limitations  when  taken  on  its  own,  these  indi¬ 
vidual  constraints  are  compensated  for  by  the  collective 
strengths.  For  example,  while  the  PROBE  and  HBS  study 
could  not  completely  rule  out  an  undue  influence  of 
Parkinson’s  disease  medications  on  SNCA  expression,  this 
question  was  fully  addressed  by  exclusively  recruiting  de 
novo  cases  in  PPMI.  We  clearly  show  that  this  is  not 
simply  a  marker  of  sick  patients;  instead,  the  association 
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is  already  detectable  in  individuals  who  are  very  early  in 
the  disease  course  and  highly  functional.  Clinical  diagnostic 
certainty  was  enhanced  through  DAT  neuroimaging. 

Beyond  revealing  an  association  between  lowered  SNCA 
expression  and  Parkinson’s  disease,  this  study  presents  a 
generally  useful  resource  of  three  purpose-designed  bio¬ 
banks  and  linked  clinical  data.  This  platform  of  biobanks 
can  now  be  used  for  accelerating  the  exploration  of  nearly 
any  biofluid  marker  for  Parkinson’s  disease  by  decreasing 
the  time  from  discovery  to  replication  from  multiple  years 
down  to  the  assay  run-time. 
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